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Abstract 

Consider a random non-centered multiple antenna radio transmission channel. Assume 
that the deterministic part of the channel is itself frequency selective, and that the random 
multipath part is represented by an ergodic stationary vector process. In the Hilbert space 
P(Z), one can associate to this channel a random ergodic self-adjoint operator having a 
so-called Integrated Density of States (IDS). Shannon’s mutual information per receive 
antenna of this channel coincides then with the integral of a log function with respect to 
the IDS. In this paper, it is shown that when the numbers of antennas at the transmitter 
and at the receiver tend to infinity at the same rate, the mutual information per receive 
antenna tends to a quantity that can be identified and, in fact, is closely related to 
that obtained within the random matrix approach m- This result can be obtained by 
analyzing the behavior of the Stieltjes transform of the IDS in the regime of the large 
numbers of antennas. 


1 Introduction and problem statement 

In the landmark papers by Foschini and Cans m and by Telatar 133 the great promise of the 
use of multiple transmit and receive antennas was presented and established. The importance 
of such Multiple Input Multiple Output (MIMO) links is based on the fact that parallel data 
streams emanating from different transmit antennas can be decoded simultaneously from 
the receive array, thus making the throughput scale linearly with the number of transmit 
antennas. This scaling property is important in the quest to meet the expected thousand¬ 
fold increase of wireless network capacity in the coming years [1] . At the same time, the 
necessary transmit power per data stream is reduced by the same factor, thus also addressing 
energy-related pollution issues, which are becoming major societal and economical concerns 
miin]. In fact, so-called massive MIMO systems of a few hundred antenna arrays |26l [35] 
have been identified as a key enabling technology for the next generation 5G wireless networks 
[3], promising unprecedented data transfer increases. 

In this paper, we address the calculation of Shannon’s mutual information of MIMO 
channels under some general assumptions on the channel statistics who have been considered 
only partially in the literature. To start with, we shall assume that the channel is subjected 
to time correlations that are more realistic than the commonly assumed multi-block-fading 
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model HIE]. A second assumption on the model is related to the so-called frequency selectivity 
of the channel due to the delay spread induced by the reverberations of the signal from 
buildings, walls, etc. Typically, this effect is modelled using a tap-delay line system with 
different tap variances laEj. A third component of the channel model is related with the 
so-called Ricean (or “deterministic”) part, due to e.g. a line of sight or a specular component. 
In this work, we consider in most generality that the deterministic part of the channel is also 
frequency selective (Hj. 

To be more specific, let us denote by T the number of antennas at the transmitter and 
by N the number of antennas at the receiver. Then the C^-valued signal received at time k 
according to our model is 

L 

Y{k)= H{k,i)S{t) + V{k) 

e=-L 

where the processes {S{k)}kez and {V{k)}k^z represent respectively the C^-valued input sig¬ 
nal fed to the channel and the C^-valued additive noise. The channel with 2L-I-1 matrix coef¬ 
ficients is represented valued random process {H{k) = [H{k,k — L),... ,H{k,k + 

L)]}fcgz, assumed to be Gaussian, stationary, ergodic, and generally non-centered. It is 
assumed that the processes {5(fc)}, {V{k)} and {H{k)} are mutually independent, and 
that {5(A:)} (respectively {V{k)}) is an independent process such that S{k) ~ CM{0,It) 
(resp. V{k) ~ CM{0, In)) for any /c G Z. 

As is well-known, the Doppler effect induced by the mobility of the communicating ter¬ 
minals determines the form of the covariance function of the process {H{k)}. The multipath 
effect that induce the frequency selectivity is captured by the 2L -|- 1 matrix channel coeffi¬ 
cients which are subjected in the practical situations to a certain power profile. In addition, 
the fact that Eii'(O) can be non zero and can be also frequency selective is due e.g. a line 
of sight or a specular component. We shall formulate our assumptions on the channel more 
precisely in Section below. 


The signal = [Y{—n)'^,...,Y{n)'^]'^ observed during the time window (—n, ...,n) 
satisfies the equation Y'^ = where = [V{—n )'^,..., V{n)'^]'^, 5” = [S{—n — 

L)^,..., S{n + L)]'^, and 


= 


H{—n,—n — L) ■■■ H{—n,—n + L) 


H{n,n — L) 


H{n,n + L) 


( 1 ) 


is a (2re -|- 1)A^ x (2n -|- 2L -I- 1)T matrix. Our goal is to study the mutual information of 
this channel assuming that it is perfectly known at the receiver. More precisely, we study 
I{S] {Y, H)), the mutual information between {^(/c)} and the couple {{Y{k)}, {H{k)}). It is 
given by the equation 

1(5; (y, H)) = hmsup (Y^, H^)) 

n 2n + l 


where 

1(5”; (y”, H^))=E log det{H^H^* + I) 

is the mutual information between 5” and {Y"',H^) (see [HI Chap. 8], [37j and Section]^ 
below). 
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We shall see that the natural way to tackle this problem is to consider the MIMO channel 
as a random operator represented by the doubly infinite matrix H = and acting 

on the Hilbert space of the doubly infinite square summable sequences. Due to the 

ergodicity of {H{k)), this operator that we also denote as H is ergodic in the sense of [30] . 
By the ergodicity, it turns out that the self-adjoint operator HH* where H* is the adjoint 
of H has a so-called Integrated Density of States (IDS). Specifically, there exists a positive 
deterministic measure p, of total mass one such that on a probability one set, 

—Tr5(M"*) ^ [ 9{X)p{dX) 

{In + IjiV n^oo J 

for any continuous and bounded real function g. The IDS is the distribution function of p. The 
crucial observation is that this convergence leads to the convergence of I{S^] (D"", H'^))/{2n + 
I), and the limit is 

I{S- {Y, H)) = N j log(l + A) p{dX) 

as it will be shown below. Our goal is then to study the behavior of the integral at the right 
hand side. Unfortunately, a few can be said about this behavior in the general situation. To 
circumvent this problem, one needs to resort to a certain asymptotic regime. 

In this paper, consistently with an established practice in the evaluation of the mutual 
information of MIMO channels, we consider an asymptotic regime where the numbers of 
antennas N and T tend to infinity at the same rat^ A central object of study will be the 
Stieltjes transform of the measure p, that is the complex analytical function 

m{z) = I j^KdX) 

on the upper half-plane C+ = {z : ^2 > 0}. This function completely characterizes p and is 
intimately connected with the resolvent 

Q(z) = {HH* - zl)-^ 

of the operator HH*. Specifically, considering the matrix block representation Q{z) = 
[Q{k, i){z)]k,i£z of the resolvent where the blocks Q{k,i){z) are N x N complex matrices, it 
holds that 

TrEQ(0,0)(z) 
m(z) = --. 

The core of our analysis consists in studying the behavior of the right hand side of this relation 
in the large N, T regime. Re-denoting p and m{z) as pT and mT{z) to stress the dependency 
on T, it turns out that there exists a sequence of probability measures tvt which approximates 
Pt in the sense that 

rriTiz) — / -- TVTidX) -)• 0. 

J X — z r->-oo 

In addition, we can associate to each ttt a positive number It of order one for large T, and 
such that 

Li{S-{Y,H))-It^0. 

iV r->-oo 

^We note that other asymptotic regimes can also be studied by the techniques of this paper, for instance 
the one where N and T are fixed and where L tends to infinity. 
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Thus, the mutual information increases at a linear scale with the number of antennas at one 
side of the transmission. The Stieltjes transform of ttt as well as It satisfy a system of equa¬ 
tions that can be easily implemented on a computer. Solving for It, one can evaluate the 
impact of the various statistical parameters of the channel model on the mutual information. 

In the literature dealing with the ergodic mutual information of MIMO channels, it has 
been frequently assumed that the channel was a frequency non selective channel which sat¬ 
isfies the simplifying block-fading assumption (see e.g. [37j). In the language of the ergodic 
operator theory, the operator HH* corresponding to such channels is a block diagonal oper¬ 
ator with independent and identically distributed diagonal blocks, and the existence of the 
IDS is immediate. In order to approximate the Stieltjes transform of ^lt in the regime of the 
large number of antennas, one only needs to study the behavior of 

^TrEQ(0,0)(z) = ^TTE{H{0,0)Hi0,0r - zIn)-^. 

The large N, T behavior of this object can in fact be studied with the help of random ma¬ 
trix theory techniques, without making any explicit reference to ergodic operators. In this 
framework, a number of works studied N~^ Tr E(5(0,0)(z) with more and more sophisticated 
statistical model for H{0,0). Among these, centered models with the so-called single-sided 
or double-sided correlations were studied in [IIlETlEaE]. A more general centered model 
was considered in [38]. Non-centered models were considered in [281 [36l [T3| among others. 
One contribution of our paper is that we consider frequency selective channels and a model 
for the Doppler effect more realistic than the block-fading model. 

Ergodic operator theory (see e.g. |30|) has aroused a considerable interest in the fields of 
operator theory, quantum physics, and statistical mechanics where these operators are fre¬ 
quently used to model the Hamiltonians of randomly disordered systems. In this framework, 
the contribution |24j studied the asymptotic behavior of the IDS of certain random Hermitian 
operators when some design parameter is made converge to infinity. For the models consid¬ 
ered in [231, it is shown that the limiting IDS coincides with that of a deformed large Wigner 
matrix, well known in random matrix theory. The connections between ergodic operator 
theory and random matrix theory are further explored in m- In this paper, we adopt the 
general approach of MM, dealing with random operator models a la Marchenko-Pastur. 
In order to perform our asymptotic analysis, we rely on two tools that belong to the arsenal 
of random matrix theory, as shown in |32| and the references therein: an integration by parts 
formula for the expectation of functions of Gaussian vectors (already used in |24)). and the 
Poincare-Nash inequality to control our variances. 

In the communication theory literature, the ergodic operator formalism for studying the 
mutual information of multi-antenna channels was used in [211 122] . These contributions 
brought to the fore the relation between the mutual information and the IDS without elabo¬ 
rating on the properties of the latter. 

In the next section of this paper, we formulate precisely our assumptions and we state our 
results. These results are illustrated in Section]^ by some computer simulations. An overview 
of the ergodic operator theory is provided in Section [^ The proofs of the main results are 
provided in Sections and 
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2 Assumptions and the results 

We start by introducing the exact statistical model of our channel. Since the number of 
antennas will tend to infinity, the channel will be described by a sequence of random operators 
on indexed by the parameter T. 

Given a probability space (fl, P) and two sequences of positive integers {N{T))T£n and 
{L{T))t£Ni consider the sequence of random operators 

Hx = A.X + Xx 


with domain 'D{Hx) = /C, the manifold of vectors of P{1^) with finite support. The operator 
Ax is a deterministic operator whose matrix represented in the canonical basis of by 

the matrix Ax = [Ax{k — i)]k/£z where the block Ax{k) is a N{T) x T complex matrix, and 
where the sequence (..., 1) At(0), ... is supported by {—L(T),..., L{T)}. Since 

the blocks of Ax are identical along the diagonals. Ax is a convolution operator [7]. 

The operator Xx is a /C-defined random operator on (in the sense of [301 Sec. I.l.B], 

see Section below) whose matrix representation in the canonical basis of /^(Z) is the band 
matrix 


Xx 


■■ ^^(O,—1) X'r(0,0) Xr(0,1) 

0 

[Xx{k,e),k,ie'L,\k- <L{T)] . 


The N{T) X T random complex matrix Xx{k,£) is written as 


0 


Xx{k,£) = [Xx,nAkJ),n = 0 : {N{T) - l),t = 0 : {T - 1)] 


and satisfies 

Xx{k, i) = . _ (f)xik — l)Wx{k, t). 

vT 

Here : Z —)• [0, oo) is a real valued function supported by the set {—L(T),..., L(r)}, and 
{Wx{k,£) = [Wx,n,t{k,i),n = 0 : {N{T) — l),t = 0 : (T — l)])fc,£ez is a complex Gaussian 
circularly symmetric centered random field such that 

nWx,n,,tAkl,h)WT,n,,tAk2j2)] = ^ni ,n2 ^ti,t2 —£i,k2 — t2lT{kl - k2) 

where the covariance function 'yxik) satisfies 7 t (0) = 1 without generality loss. 

The operator Ax models the deterministic part of the channel. The function (jjx models the 
multipath amplitude profile, while the covariance function ^x is related to the Doppler effect. 
As is well known, the function 'yxik) converges quickly to zero when the mobile terminal 
velocity is high. 

The following set of assumptions will be needed: 

1. 0 < liminf N{T)/T < sup N{T)/T < oo. We write c = sup^^ A'(T)/T. 

T^oo X 

2. ax = (t>T{£A satisfies 0 < liminf^ a"^ < sup^^ cj|, < oo. We let cr^ = sup-j^ a'^. 

i 
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3. g = sup V |7 t( 

^ i 


4. a = sup 

T 


L(T) 

E 

e=-L(T) 


is finite. 


||747’(f’)|| is finite, where || • || is the spectral norm. 


The asymptotic regime described by the above assumption will be concisely denoted “T ^ 
oo”. Notice that there is no bound on the number of channel coefficients 2L{T) + 1. 

Let us comment these assumptions. The parameter in Assumption is the part of 
the received power due to the centered part of the channel. The practical interpretation of 
Assumption is that the so-called coherence time of the channel [5] does not grow with T. 
At this point, we stress the importance of the fact that our results on the mutual information 
assume the channel to be perfectly known at the receiver. In practice, this channel needs to be 
estimated, and this task will be getting harder as T grows if the coherence time is kept fixed 
(see however the discussion around Fig. below). We believe that relaxing Assumption]^ 
requires other mathematical tools than those used in this paper. Finally, Assumptioncould 
be certainly weakened and replaced with e.g. a bound on the Euclidean norms of the columns 


and the rows of the matrices A{t) at the expense of a more involved proof for Theorem 2.3 
below. We also point out that the convergence rates obtained in the proof of this theorem 
can be improved. We chose to keep Assumption and the present proof for simplicity. Note 
also that we assume the Gaussian distribution of various random variables processes above 
largely for the simplicity of the corresponding proofs. By using a version of the so-called 
interpolation trick (see [32l Sections 18.3-18.4]), our results can be extended to the case 
where the corresponding ergodic random processes are not necessarily Gaussian just having 
the same covariance and with a certain number finite moments and sufficiently fast decaying 
correlations, although the respective proofs of the results become rather involved. 


As mentioned in the introduction, the Stieltjes transform of a probability measure v on 
M plays a fundamental role in this paper. This is the function 

= J 

defined on C+. This function is i) holomorphic on C+, ii) it satisfies 5 ( 2 ;) G C+ for any 
z G C+, and in) limy_,.oo(—*y)s(^?/) = 1. In addition, if u is supported by M+ = [ 0 , 00 ), 
then iv) ‘A{zs{z)) > 0 for any 2 ; G C+. Conversely, any function s{z) satisfying i)-iv) is the 
Stieltjes Transform of a probability measure supported by M+ [2^. Observe that the Stieltjes 
Transform of v can be trivially extended from C+ to C —supp(z 2 ) where supp(i^) is the support 
of n. Finally, “probability measure” can be replaced with “positive measure n such that 
0 < z/(M) < 00 ” in the preceding statements if we replace Hi) with limy_,.oo(—*y)s(*y) = z^(M). 

We shall now state our results. Due to the fact that 7 t is summable, the Gaussian 
stationary random process 

{HT{k) = [Hrik, k - L{T )),..., Hrik, k + L{T))]}k^z 

is ergodic. This makes the operator Ht ergodic in a sense made precise below. The char¬ 
acterization of the mutual information in the framework of ergodic operators is given by the 
two following results. 
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Proposition 2.1. On a probability one set, the operator Ht is closable. Still denoting by Ht 
the closure of this operator and by its adjoint, the positive self-adjoint operator HtH^ is 
ergodic and has an IDS defining a probability measure fiT- Denoting by Qt{z) = [HTHif — 
zl)~^ = [QT{h,t}{z)\k/&'L the resolvent of for z G C+, where the blocks QT{k,£){z) 

are N x N matrices, the Stieltjes transform of p,T is mT{z) = A^“^ETr(5T(0,0)(2;). 

Theorem 2.1. The sequence {I{S'^] , H'^))/{2n-\-1)} converges as n ^ co, and its limit 

is the mutual information IriS; (Y, H)) = N J log(l + A) firidY) < oo. 

Let 

7t(/) = ^exp(2z7r/c/)7r(A:) 

k 

be the Fourier transform of the sequence {jTik)}, and 


^r(/) = ^exp(2z7rA:/)ylr(fc) 


be the N xT Fourier transform of the sequence {AT{k)}. Write {ATAf){f) = AT{f)Af{f) 
and (A^Ayif) = Alf{f)AT{f) for compactness. 

Theorem 2.2. Let STif,z) and ST{f,z) be respectively the N x N and T xT matrices 


^TifiZ)— —2:(l + iTy72’(/) * ^r(/, + (l + <Ty72’( —/) * ¥5'r(/, z)) {ATAjI){f) 


1-1 


( 2 ) 


Srif^z)— —2:(l + iTf’72’(—/) * 2:))/r + (l + o't7t(/) *‘Ft(/) ^)) 


1-1 


( 3 ) 


and 


z) 

7t(-/)*<Ft(/> z ) 



u)ipj^{u, z) du, and 


f)ipT{u,z)du. 


Then for any z G C+, the system of equations 


T>Tif^z) 


TvST{f,z) 

T 


and tprp(f,z) 


TTSTif,z) 

T 


admits a unique solution z),(prp(^-^ z)) such that ^pj,{-,z),(f:>j,{-,z) : [0,1] —)• C are both 

measurable and Lebesgue-integrable on [0,1] and such that Q(p{f,z), Is(p{f,z), Q{z(p{f, z)) 
and ii{z(p{f, z)) are nonnegative for any f G [0,1]. 

The solutions z) and z) are continuous on [0,1], and Is(p{f, z), ^ip{f, z), Q{z(p{f, z)) 
and Is{z(p{f, z)) are positive for any f G [0,1]. Furthermore, for any f G [0,1], the functions 
{T/N)(p{f, z) and tp{f, z) are defined on C —M+ and are the Stieltjes transforms of probability 
measures supported by [0,oo). 
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This system of equations turns out to be formally close to that described in [THl Th. 2.4], 
The uniqueness established in the present paper is a point-wise uniqueness (i.e., for any 
z £ C+) which is stronger than the type of uniqueness shown in [18]. The proof of Theo¬ 
rem is given in Section]^ 


The discussion preceding Proposition 2.1 shows that the function 

Pr(^) = ^^ 'TtSTif,z)df 

is the Stieltjes transform of a probability measure carried by [0, oo). We denote it ttt- 
Theorem 2.3. For any z G C+, 


mTiz)-pT{z)- -^0. (4) 

T^oo 

Moreover, the sequences {/^r} and {t^t} are tight, and 

j gWhridX) - j g(A) nridk) 0 

for any continuous and bounded real function g. 

The approximation of the mutual information for large T is finally provided by the fol¬ 
lowing theorem. 

Theorem 2.4. We have 

{Y,H))-Zt -^0, 

T^oo 

where 

It = j^og{l + X) TVTidX). 

and the integral is given by 

I logdet((l + 47 t(/) * -I))d7 + “f 

+ '^j^ log(l-Fc7|.7-r(-/)*(^7,(/,-l))d/ 

c^t7t(/ - v)iPt{v, -1)<Pt(/> -1) dvdf. 

Before turning to the numerical illustrations of these results, two remarks are in order. 
We first observe that the variance profile represented by the function : Z —)• [0, oo) has no 
influence on It except through the total received power due to the random part of the 
channel. 

We also observe that if the channel is centered, i.e., if At = 0, then tvt is the Marchenko- 
Pastur distribution 

= 27rcT^|. ~ ~ dX + ((1 - c],^) V 0)6o 
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where ct = N/T, A+ = a^{l + and A_ = <t^{1 — y/cr)^. It is indeed well known 

that the Stieltjes Transform of ttt is cjf^a{z), where a(z) is the unique solution in C+ of the 
equations 

a{z) = cn{—z — za‘^a{z))~^, a{z) = {—z — za^a(z))~^. 

Recalling that f 'yj'i f) = 1) one can immediately check that when At = 0, the couple 
{iPTif,z),(pTif^^)) — ■: ^{^)) is a solution of the system described in the statement of 

Theorem |2.2[ The result follows by the uniqueness of this solution. 


3 Numerics 


In this section we will present the behavior of the mutual information for some representative 
cases and will compare with numerically generated instantiations. To begin with we present 
the usually accepted model for the temporal correlation of fast fading, he., the so-called Jakes 
model [20], which, in the time domain has the following correlation form: 


7r(t) 


Jq 


2T:vt\ 


which in the frequency domain becomes 

1 1 

within the region |/| < and zero elsewhere, where fd = v/\t, the ratio of the velocity 
of the mobile to the wavelength times the time duration of the channel usage. It should be 
noted that the discontinuity at fd results to 7r(t) not being absolutely summable, and hence 
strictly speaking it cannot be used in this paper. However, demanding the frequency response 
to be continuous at f = fd, (by adding a small rounding factor in the frequency domain), we 
can make this to be an acceptable model for the system. For simplicity we will not include 
this in the simulations. 

To move on, we need to also present a model for the deterministic matrix function A'r(/). 
The typical situation for wireless communications is that the constant matrices are due to line- 
of sight rank-one components. Along these lines we assume that each of the time resolvable 
paths have the following matrix elements 

AT{k)m,n = exp [-\k\C/Ltot] exp [27rj(m - n) sin6»fc] /Vn 



where 6k = kn/Ltot, for k = —L,... ,L and Ltot = 2L + 1. The exponential dependence on 
the delay spread ^ has been seen experimentally |33ll9|. Then Axif) follows from 

L 

Arif) = ^T{k)exp[-27rj£f] 

e=-L 


Note that the rank of the above matrix is L^ax- 

In the next figures, we observe the behavior of the mutual information as a function of 
the Signal to Noise Ratio (SNR) parameter 

p = al + ^ j\x{ATA*r,){f)df 
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in various cases. In Fig. [^we plot the mutual information for different values of the Ricean 
coefficient K defined as the ratio between the power of the deterministic part channel to that 
of the of the random part of the channel for the case. We see that for increasing values of K 
the mutual information initially increases because the deterministic channel provides higher 
throughput, but then decreases because for higher K the rank-deficiency of the deterministic 
channel becomes apparent. 

In Fig. we see that the mutual information decreases with increasing inverse delay 
spread which is essentially the frequency correlation bandwidth. This behavior is expected, 
because the smaller ^ is the smaller the attenuation and thus the higher the strength of the 
channel at the receiver. 

We have also found that the dependence of the mutual information on the value of fd 
is rather benign and not discernable for the above type of deterministic channel. This can 
be attributed to the fact that the dependence of the eigenvalue distribution of (A'rA^)(/) 
on the frequency value / is quite small. For other less realistic examples of deterministic 
channels, e.g., the case where A{k) = exp(— |fc|^)/Ar, where In is the identity matrix, does 
depend strongly on fd- 

Finally, in Fig. we compare the results obtained from the methodology presented in 
this paper with numerically generated values. In this figure we plot the average mutual 
information generated numerically for various block sizes (here depicted by M). In this 
case for simplicity, we have used the exponentially decaying temporal correlation model,i.e., 
with 7r(^) = exp(—IA:|/rf). This model is easier to implement numerically and absolutely 
summable, but is not very realistic because it is very wide-band. The first important ob¬ 
servation here is that this approach gives near exact results for antenna arrays as small as 
N = T = 2 presented in this figure. Second, we see that depending on the value of fd the con¬ 
vergence to the asymptotic result depends on the actual block size. For small fd (temporally 
correlated channels) the M = 5 set of points (diamonds) is significantly deviating from the 
analytic curve, while for large fd the M = 5 simulations are much closer to the asymptotic 
result. Increasing the block size to M = 40 makes the numerical values right on the analytical 
ones. 

4 An overview of ergodic operators. Proofs of Proposition |2.1| 
and Theorem 12.11 

To make this paper reasonably self-contained, we provide a brief review of the ergodic operator 
theory that will be useful to our purpose. In passing, we shall prove Proposition |2.1[ For a 
comprehensive exposition of this theory, the reader may consult the book m- 

Since the operator Ht is defined for every cj G on the dense linear subspace JC of P (Z), 
and since the vector Hto. is a random vector in Z^(Z) for any a G /C, the operator Ht is 
random, following the definition of |30l Sec. I.l.B]. 

For the presentation simplicity, let us temporarily assume that N = T = 1 and suppress 
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Normalized Mutual Information vs SNR; fp=0.25: ^=1; N=T=10; L=5 



Figure 1: Mutual Information versus p for various K 


the index T. Since 1C is dense, H has an adjoint H* with domain 
V{H*) = < oo'^ 

k 

L 2 

= |o = ^afcefc : y~]| H{k + £,k)ak+e < ooj 


fcez 


fcez i=-L 


where ek is the canonical basis vector of /^(Z). Clearly tC C so this domain is 

dense in /^(Z). Therefore, H is closable, and its closure coincides with H**. In the remainder, 
we reuse the notation H to designate this closure. Note that with this extension, the set 1C 
becomes a core for H IMl Chap. VIII]. Since H is closed and densely defined, the operator 
HH* is a positive self-adjoint operator with 1C as a, core [21 Th. 46.2]. 


We shall now consider the ergodic properties of HH*. Making explicit the dependence on 
the elementary event a; in the definition of the process 


H{co) = {Hiio, k)}k^z, H{io, k) = [H{k, k-L),..., H{k, k + L)], 
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4.5 


Normalized Mutual Information vs SNR; K=1: f^=0.25: N=T=10: L=5 



Figure 2: Mutual Information versus p for various ^ 

it is easy to see that the summability of 7 implies that the shift i? : —)• defined by the 
equation H{Buj,k) = H(uj,k + 1) is ergodic. Moreover, the random operator H[oj) clearly 
satisfies the equation 

H{Buj) = UH{uj)U-^ 

where U is the (unitary) shift operator Ua = Ylk^k+i^k for o = Ylk^k^k £ An 

operator on that satishes such an equation where B is ergodic is said ergodic in the 

sense of 1301 page 33]. Now, writing {HH*){oj) = H{oj)H*{uj), we also have {HH*){Boj) = 
U{HH*){uj)U~^. In addition, it is clear that UK, = K. Therefore, the operator HH* is also 
ergodic in the sense of |3Q1 page 33]. 

Consider now the sequence of hnite dimensional matrices obtained by trun¬ 

cating the matrix HH* and keeping the elements (i, j) such that |i|, \ j\ < n, see Eq. 0. Let 
X-n,n, • • ■) Xn,n be the eigenvalues of and let 

1 

=- y 5x- 

^ ^ 2 n-bl ^ 

i=—n 

be the Normalized Counting Measure (NCM) of this matrix. Since the operator HH* is 
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Nonnalized Mutual Information vs. p; K=l; N=T=2; ^=0 



Figure 3; Mutual Information versus p for various L and block sizes 


represented by a band matrix, we have the following result, stemming directly from m 
Th. II-4.8]: 

Proposition 4.1. The operator HH* has an IDS. In other words, there exists a deterministic 
probability measure p, on [0, oo) such that 


[ g{\)u-{dX) [ gi\)p{dX) 


(5) 


for all continuous and bounded functions g. Moreover, the Stieltjes transform of p coincides 
with EQ(0, 0)(z). 


Proposition 2.1 for iV = T = 1 follows. It will be useful to outline the proof of Proposi¬ 
tion |4?^ The idea is to establish the convergence in the statement of this proposition for a 
large enough family of continuous functions. Given a continuous function g, start writing the 
left hand side of Eq. (j^ as 


g{X)v'^{dX) = 


1 


2 n + 1 


^ g{H^H^*){i,i) 
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where is the element {i,i) of the matrix g{H^H^*). Suppose that for large 

n, the operator at the right hand side of this equation can be replaced with HH*. Denote 
by g{HH*)(i,i) the {i,i) element of the matrix representation of g{HH*). Observe now that 
since HH* is ergodic, then by m Theorem 2.7], its resolution of the identity E\ is an ergodic 
projection for every A G M. As a result, the operator g{HH*) = f g{X)dE\ is also ergodic. 
Thus, 


1 ” 
i=—n 


1 

= 2n + l ^ {ei,g{HH*){u)ei) 

i=—n 

n 

= E {eo,U^9{HH*){u,)U-Ho) 

i=—n 

n 

= Y.(^^^9{HH*){B^u;)eo) 

i=—n 

-^Eg{HH*){0,0)=E [ g{X){eo, E{dX)eo) 

n^oo J 


by the ergodic theorem, provided the (0, 0) element g{HH*){0, 0) of g{HH*) is integrable. It 
follows that fi{dX) = E{eo, E{dX)eo). 

By taking g{X) = (A — z)~^ where z G C+, we get that the Stieltjes transform of g, coincides 
with Eg{HH*){Q, 0) = EQhh* (0, 0 )( 2 ;). 

It remains to generalize these results to the case where the HT{k,t} are N xT matrices. 
The closedness of Ht and the existence of the self-adjoint operator HtH^ are direct general- 
izations of the scalar case. Writing k) = [HT{k, k -|- the shift B defined by 

the equation Ht{Buj, k) = HT{u},k + 1) is ergodic. Since the operator HtH^ satisfies the 

goes along nearly without modification. Let us just check that the Stieltjes transform of 
gT is A^"^TrEQ(0,0)(2;). Writing g{HTH^)p^q{k,£) = {ckN+p, g{HTHf,)eiN+q), we get by 
mimicking the derivation above that 


identity {HtH^){Bu]) = {HTH^){io)U it is also ergodic. The proof of Proposition 


t-n 


4.1 


I 

{2n + l)N 


N—l n 

^ ^ g{HTH*^)rAi,i) 

r=0 i=—n 


lTrE<7(77T^^)(0,0). 

n^oo iV 


Taking g{X) = {X — z) we get the result. 


Proof of Theorem 12.11 

As is well known (see e.g. 131 ]), 


I(S; (Y,H)) =limsup^‘^—77’") + 1(5”; I 77”)) 
n 2n + l\ / 

= limsup-^7(5”;y-|77"") 

n 2 tI + 1 

= limsup ^ Elogdet(i7"’j7"* -7 7(2n+i)jv) 

n ^Tl “T 1 
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by the independence of S"^ and H^. Let be the NCM of the random matrix . Then 

^2nll)N logdet(F-F-* + I) = j log(l + A) vJ}{dX). 

The measure satisfies 


Ai/^(dA) = 


Tr(iL"iL”*) 

(2n + l)iV 

^ n L 

(2n + l)N ^ 

A ^ m=-ni=-L 

^ n L 


m — 


< 


,2n + l\N ^ E(Tr^(^)vl(£r + TrX(m,m-^)X(m,m-£)*) 

V ' ' rry, — n _ T 


m=—n £=—L 


N 


TTA{i)A{iy 


i=-L 


+ 2 ^ TvW{m,m-£)W{m,m-ir 

£=-L ^ “T i ' m=-n 


( 6 ) 


Since the process {W{m,m — i)W{m,m — i)*}mez is ergodic, it holds that 

1 


2 n + 


-V W{m,m-i)W{m,m-i)* ^^TIn- 

1 n^oo 


We therefore get that 

r ^ 

limsup / A^'^((iA) < 2( ^ \\At{(.)\\^ + 0 ''^) w.p. 1. 


i=-L 


Let us consider an elementary event in the probability one set where converges weakly 
to ht by Prop. 4.1 and where sup„ J" XduJj^ < oo. By uniform integrability, we get that 
f log(l + X)vJ^{dX) —)• f log(l + X)^T{dX) < oo on the elementary event we just considered. 
Consequently, /log(l + X)i'^{dX) —)• Jlog(l + X)fiT{dX) < oo almost surely. 

Getting back to the inequality Q, it is clear that sup„E f XvJy{dX) < 2('y2^ ||j47’(t')||^ 
Therefore, sup„E(J log(l + X)vj,{dX)y < oo which shows that 


'2 + 4) 


E 


log(l + X)v£^{dX) -> 

n^oo 


y log(l + X)iJ.T{dX) < oo 


by uniform integrability. 


5 


Proof of Theorem 


2.2 


We start by showing the uniqueness of the solution z),(prp(^-^ z)) satisfying the inte¬ 

grability and the non-negativity conditions. We then provide a constructive existence proof, 
and prove in passing that the constructed solution satisfies the Stieltjes transform properties 
provided in the statement. 
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5.1 The uniqueness 

The general idea of the uniqueness proof can be found in e.g. [I2j or [19]. Applying this idea 
to the case of this article requires some specihc work. We fix z G C+ and we re-denote herein 
S{f,z), etc. as ip{f), S{f), etc. for simplicity. 

We start with some lemmas. Assume that (pj’{f)) is a solution. Then 

Lemma 5.1. The solution (¥^t(/))‘^t(/)) satisfies the equation 


where 



- C ' 

^{zp{f)) 

L 


KfPU.n) 






/G [0,1], 


^ip{u) 

'^{zp{u)) 


du 








K^fU^u) 


TTS{f){AA*){f)S{fr 

T\l + CiW 

TrS{f){A*A){f)S{fr ^, 

T\i + ~aw 


/), 


u), 


Fr‘^{f) 


Ft'^if) 


TvS{f)S{fr 

^ T ’ ^ 

TvS{f){A*A){f)S{fr 

T\l + C{fW 


Cif) = and C(/) = c^^7(/) * 


Proof. We have 


^Pif) = 


2i 

1 


= -F-Ti-S(S-* - s-^)s* 

2iT ^ ^ 

= ^ Ti’ -S' ({z - z)I + o-^7(/) * {zp{f) - z(p{f))I 


+ 


iAA*){f) iAA*){f) 

1 + C(/) 1 + C(/) 


TrS(/)S*(/) 


T 


+ (^C(/)) 


TTS{f){AA*){f)S*{f) 

r|iTc(/)P'^‘^^ ' 
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By a similar derivation, we also have 


*-z ^)S* 


= (o'^7(-/)*9'¥^(/)) 


zpTrS(/)5 (/) , ^ TrS{f){A*A){f)S (/) 


+ ^z 




- T\i+aw 

TTS{f){A*A){f)S*{f) 

r|i + C(/)P 


hence the result. □ 

Lemma 5.2. For any f G [0,1], > 0, F^’^{f) > 0, 0 < Q{z^{f)) < C, and 

> C'Tr(A*A)(/) where C and C are positive numbers that do not depend on f. 

Proof. Let Aq, • • • , Xn-i be the eigenvalues of (AA*)(/). We start by establishing an upper 
bound on | — z{l + C) + -^i/(l + C)l- Observing that m.axf^{f) < ^ | 7 t(-^)| < 9 and writing 
b{z) = Jq \ip{f, z)\df < oo, we have |z(l + C)| < \z\il + cr‘^gb{z)). Moreover, |1 + Cl = 
|z + zCl/kl > since > 0 by assumption, and 7 (/) > 0 . Therefore, |Aj/(l + C)| < 

and |—2;(1 + C) + Ai/(1 + C)| < \z\{l + (T‘^gb{z) + a‘^/’isz). The inequality >0 

follows readily. Moreover, 




l--(i + 0 + i^P |-^(1 + C) + T^P 


( 7 ) 


The inequalities on 9(z<^(/)) and are proven by similar means. □ 

Lemma 5.3. The functions (p and ip are continuous on [0,1]. 

Proof. Since (p is integrable and 7 is continuous, C is continuous by the dominated convergence 
theorem, and similarly for C- Moreover, (AA*) is continuous as a trigonometric polynomial. 
It follows that the matrix functions S and S are continuous, hence the result. □ 


To establish the uniqueness of the solution, we observe that by setting 




^P{f) 




and 




^{zcpif)) 


^{zg:>{f)) 


on [0,1]^, Lemmas 5.1 and 5.2 show that 


0 < / Jf’^(/, u,z)du<l- Tlz < 1 

Jo ^P[f,z) 


and 


0 < 4’^{f,u,z)du < 1 - < 1 - CTV(A*A)(/) 

Jo '^{Zip[J, Z)) 


( 8 ) 

(9) 
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where C > 0. Assume now that and are two solutions. Denote respectively 

by {S, S) and {S', S ) the matrix functions associated to these solutions as in the statement 
of the proposition. Write A(^(/) = <^(/) — <p'{f) and zAif{f) = z(f{f) — zip'{j). Mimicking 
the proof of Lemma |5.1t we get 

A<^(/) = ^Tr5(5'-i-5-1)5' 

(/Cii(/, u)Aif{u) + JCuif, u)zAip{u)) du 

and 

zA^if) = ^TrS{s'~'-S~^)S 

{K^ 2 i{f, u)A(p{u) + /C 22 (/, u)zAg:>{u)) du 




where 


JCuif,u) 
ICuif,u) 
IC2iif,u) 
JC22{f,u) 


Tr5(/)(AA-)(/)5'(/) _ 

r(l + cr‘^nf{-f) * ‘^(/))(1 + cr^7(-/) * <^'(/)) 

Tr5(/)(A-A)(/)5(/) , _ 

r(l + u 27 (/) * <^(/))(l + a 27 (/) * <^'(/)) 


( 10 ) 


Let 


£(/) = 


A(p(/) 


VWUiWU) 


and e(/) = 


zA^if) 


V^iz^UiWWU)) 


Using the inequality | Tr(MiM 2 )| < Y^Tr-(MiMj^) Y^Tr(M 2 M|) for any two matrices Mi and 
M 2 with compatible dimensions, we get 


e{f)\ < 


if, u)'Aip{u) 

Jo ^ 



e{u)\ 


+ 


u)^{zif{u)) if, u)'A{zip'{u)) 

TWy VWU) 



du 


Let e = supj-g[o,i](k(/)l V |e(/)|)- Using the inequality ^/ax: + y/bd < y/a + by/c + d where 
a,b,c, d > 0 along with the Cauchy-Schwarz inequality, we get 


k(/)| - [J^ «/i’’‘^(/,«)'^w 


1/2 


Ji if, u) du 


1/2 


We shall assume that e > 0 and obtain a contradiction. By the last inequality and ([^, 
we have |e(/)| < e. Lemmas 5.2 and 5.3 show that s{f) is continuous on [0,1], hence 
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supjg[o,i] k(/)l < Turning to s{f), we get by a similar argument that 


m < f 

Jo 


K 2 {'^U,u)'Jsip{u) {f,u)^(p'{u) 




e{u)\ 


+ 


V^uwuy) 




V^izHf)) 


V^iWU)) 


e(n)| ) du 


If {A*A){f) = 0, then = 0 for any u G [0,1]. But then, Inequal¬ 

ities Q show that |e(/)| < sup„g[o,i] k('*^)l < On the other hand, if Tr(A*A)(/) > 0, 
then 

|e(/)| < ^ {f,u)duj ^ e<e 

byj^again. Therefore, we also have supjgjQ^^] |e(/)| < e by the continuity of e(/), which leads 
to a contradiction. Uniqueness is established. 


5.2 The existence 

Starting with the functions (p^^'>{f,z) = —{N/T)z~^ and ip^^\f,z) = —z~^ on [0,1] x C. 
define the recursion 




with 


(/, ^)) = (t-^ Tr 5(0 if, z), T-i TV (/, z )), 


5(0(/,z) = 


S if,z) = 


-zil + C^'‘\f,z))I+ 


-zil + C^'^\f,z))I + 


i + &Hf,z) 
iA*A)if) 
l + C^'\f,z)} 


1 -1 


C(0(/,z) = ct 27 (-/)*(^( 0 (/,^), and C^'‘\f,z) = a^'rif)*¥>^'^\f,z). 

We shall show that the sequence i^^^\f, z), z)) converges on [0,1] x C+, and that the 


limit (v^(/, z), (fif, z)) is the solution of the system described in the statement of Theorem 2.2 
Lemma 5.4. The following facts hold true: 

i) For any fc G N and any f G [0,1], the functions (p^^\f,-) and T>^^\f,-) are holomor- 
phic on C+ and satisfy \(f^'^\f,z)\ < c/Qz and \(f^^\f,z)\ < l/SVz. Furthermore, 
’~sif^^\f,z), ’Js{zif^^\f, z)), ’^(f^’^\f,z), and ^izip^^\f, z)) are all positive /or 2 gC+. 


a) In the region 


7^ = 1^; G C+ : (cVl)cr^ 


l^l 


)<^}. 
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it holds that 




(^=+1) (/, z) - (/, z) I V I<^(^+1) if, z) - (/, z) I 




in) For any f, f G [0,1] and any z G C+, 


(cV l)|z| 


((o-^ + hif - f + u)- 7(^)1 du + AirLa'^lf - /'|) . 


Proof. The analyticity as well as the inequalities stated in ij are trivially true for and 
for Assume they are for and (p^^\ Denoting again by Aq, • • ■, Xn-i the eigenvalues 
of {AA*){f), we have 

I - z(l + + Ai/(1 + + Qiz'C^^^) + Ai^C^^Vll + (11) 

Consequently, and are holomorphic on z G C+ for any / G [0,1] and they 

satisfy , z)\ < c/^z and , z)\ < l/Sz. By reproducing the inequalities Q, 

we also show that > 0. The other inequalities are proven similarly, which establishes 

Item i). 


We now show iij. Writing z) = z)—ip^’^ (/, z) and ( /, z) = (p^’^\f,z) — 

{p{k-p^fi we get by a derivation similar to the one made in Section 5.1 that 

f i)<^n\f,u,z)A^^'^{u,z) + zJC[^^{f,u,z)A^{u,z))du 


A^+^(/,z)= / {z ^/C^i^(/,u,2)A(^)(m,z)+/ c),2^(/,u,z)A(*=)(u,2;))du 


where the 1C)., 


(k) 


have the same expressions as the /C* 


ij in (10) except that the S,... there 
are replaced with ... and the S',... are replaced with .... Using the bounds 

established in i), we readily obtain 


\^n(.f,u,z)\ 

\zlC^i2{f,u,z)\ 

z~^IC2i\f,u,z)\ 


< ca‘^a^\z\^j(u — f)/(Csz)"^, 

< CCT^\z\j{f -u)/{Cszf, 

< a‘^\z\j{u- f)/{'^z)^, 

< cr‘^a^\z\‘^j{f — u)/{Csz)‘^. 


Item a) follows from these inequalities after integrating over u. 

To show Hi), we start by writing = T~^ Tr 

where we omit the parameter z. Developing the right hand side, we get 


IC^^V) - C^^'VOI = / hif-u)-u))g:>^’'\u)du 


JO 

hif - f+ u) -'yiu)\du 

Jq 
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and similarly for |C^^^(/) ~ C^*^^(/0I- Furthermore, 


{AA*){f) {AA*){f) 

i + ^W(/) i + c('=)(/') 

iAA*){f) - {AA*){f) - C^’^\f))iAA*){f) 

l + (l + C^") (/))(!+ C^")(/')) 


Recalling that A(f) is a trigonometric matrix polynomial, it can be checked that ||(AA*)(/) — 
iAA*){f)\\ < 47rLa^|/ — f'\. Item Hi) is then obtained by a small calculation. □ 


2 defined before the statement of Lemma 5.4 


IS a map on 


For any z G C+, the map /i, 
the Banach space of the continuous [0,1] —>■ functions endowed with the maximum of the 

supremum norms of the two components. Showing the existence of the solution amounts to 
showing the existence of a fixed point of hz that satisfies the inequalities provided in the state¬ 
ment of Theorem |2.2[ By Lemma 5A-ii), the map hz is a contraction for any z £TZ. Therefore, 
, z), , z)) converges for any z £ TZ, and the limit that we denote z), (p{-, z)) 

is a fixed point of hz- 


We now show that from every sequence of integers, one can extract a subsequence v{k) such 
that and z) converge uniformly on the compact subsets of [0,1] x C+. 

To that end, we start by showing that on every compact set K C C+, the families z) 

and ip^^\f,z) are equicontinuous on [0,1] x K. Let 2d > 0 be the distance from K to M, 
and let z,z' G K be such that \z — z'\ < d/2. Denote by C the positively oriented circle with 
center z and radius d. Since (p^^\f,z) is holomorphic, we have by Cauchy’s formula 




(^\f,z)-<f^^\f,z>) = 


z — z 




w 


which shows that 


Jc {w — z){w — z') 




dw 


( 12 ) 


Given two couples (/, z), (/', z') G [0,1] x K, we have z) — z')\ < z ) — 


, z)\ + |(^(^)(/', z) — , z')\. In conjunction with Inequality 


Lemma 


5.4 


in) 


shows then that the families 2:)}fceN and {<^^^^(/, ^)}A:eN are equicontinuous on the 

compact set [0,1] x iL (note that the integral at the right hand side of the inequality in 
Lemma 5.4-mj converges to zero as / — /'—)• 0 by the dominated convergence theorem). 
These families are moreover bounded on [0,1] x K. Therefore, by the Arzela-Ascoli theorem, 
one can extract from every sequence of integers a subsequence p{k) such that z) and 

^ z) converge uniformly on [0,1] x K. Considering a sequence of compact subsets of 
C+ who is increasing with respect to the inclusion and whose union is C+, one can establish 
the existence of the sequence v{k) by the diagonal process. 


Note that the respective limit functions v^^(/, z) and (^^(/, z) of z) and z) 


■yiv{k)) 

are continuous in the variable / and holomorphic in the variable 2 . They also satisfy 
|‘^„(/,^;)| < c/'Az, \(p^{f,z)\ < l/’isz, and ’^{zcp^), and 9=(z(^^) are nonnega¬ 


tive by Lemma 5.4dJ and a passage to the limit. Furthermore, hz{ipy{f,z),ip^{f,z)) exists 

ip{f,z)). Thus{(p^{f,z),(p^{f,z))- 


and is holomorphic on C+. 

For z £ TZ, z), <fy{f, z)) clearly coincides with (<^(/, ^ 
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hz{(Pv(f, z), ^^{f, z)) = 0 on TZ, hence on all C+ by analyticity. 


We just showed that converges to the unique solution {(p, ip) of the system in 

the statement of Theorem |2.2[ The functions ip and ip are analytical in the variable z and 
they satisfy \ip{f^z)\ < c/^z and \ip{f,z)\ < l/9z along with the non-negativity conditions. 
Hence, for any / G [0,1], {T/N)(p{f, z) and (p{f,z) are the Stieltjes transforms of finite 
positive measures carried by [0,oo). By Lemma 5.2, Qip, ^ip, ii(zip), and are positive. 

It remains to show that these measures are probability measures. We have 

1 ^ 


^ ^ ^ * Hf, w)) + 


A,, 


For any i, the denominator behaves as —ly when y —)• oo. Hence —{T/N)iyip{f,iy) —)■ 1 as 
y —7- oo, which establishes the result for {T/N)(p. The result for ip is proven similarly. 


6 Proofs of Theorems 12.31 and 12.4 


We start with some notations. Recall that Qt{z) = {HtH^ — zl). Denote by Q{z) = 
— zl)~^ the resolvent of the self-adjoint operator H^Ht- The complex number z G C+ 
will be always written as z = x + ly. When there is no ambiguity, the parameter z or the 
index T will be omitted for notational simplicity. We shall denote as Bm,n{k, i) or [B]m,n{k, i) 
the (m, n) element of the {k, 1) block of the matrix representation of B, the size of the blocks 
being clear from the context. The block itself will be denoted as B{k,i) or [B](k,i). 


6.1 Identities, inequalities and basic tools 

We start with a some technical results. 

Lemma 6.1. The operator Q{z)H can he eontinuously extended to a bounded operator on 
Z^(Z) (since the domain of H is only dense in (^{Tj)) with adjoint H*Q{z). The norm of this 
operator satisfies ||(5(z)ll|p < (y + |z|)/y^. 

Proof. Given any a G T){H) and any b G we have {Q{z)Ha,b) = {Ha,Q{z)b) = 

{a, H*Q{z)b), the second inequality being due to the fact that Q{z)b G V{HH*) C V{H*). 
Therefore, H*Q{z) is the adjoint of Q{z)H, and since it is defined on all /^(Z), the operator 
Q{z)H can be extended to a bounded operator. We furthermore have 

\\Q{z)HH*Q{m = ||Q(^)(/ + ^Q(^))|| < 

yZ 

hence the last result. □ 

The following bounds will also be often used in the proof without mention: 

\Qn,n'ik,i){z)\ < \\Q{k,i){z)\\ < ||Q(z)|| < -, 

y 

N-l 

E E \QnAk,^)\^ = [Q*Q]pA^,0 < -2 

fceZ n=0 ^ 

N-l I I 

^ \[QHUAm,e)\^ = [QHH*Q\,p{m,m) < 
rez p=o ^ 
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Lemma 6.2. H*Q{z)H = I + zQ{z) and HQH* = I + zQ{z) by continuous extensions of 
the left hand members. 

Proof. The proof is based on the polar decomposition H = U\H\ of H |23l §VI.2.7]. We 
recall that \H\ = {H*, that U is a partial isometry from TZ(\H\) to 77(i7) (here TZ{-) is 
the range of an operator and TZ{-) the closure of this range), and that H* = \H\U*. Letting 
a G PiH) and b = Q{z)Ha, we have Ha = HH*b — zb, or equivalently b = z~^H{H*b — a) 
that we rewrite as 6 = Uh with h = z~^\H\{\H\U*b — a). Since h G Tl{\H\), U*Uh = h, 
therefore h = z~^\H\{\H\h — a) or equivalently h = (|77p — zI)~^\H\a = |77|(|7rp — zl)~^a. 
Finally, the vector w = HfQ{z)Ha^= H*b satisfies w = \H\U*Uh = |-f7p(|i7p — zl)~^a = 
H* HQ{z)a = {H*H — z)Q{z)a + zQ{z)a = a + zQ{z)a. □ 

The following lemma can be shown by direct calculation. 

Lemma 6.3 (Differentiation formulas). Given an integer M G N, let be the random 
operator represented by the matrix H{k, i)] and letQ^{z) = — 

zl)~^ = [[Q^g{k,(l){z)]p~2(fk,e&z be the resolvent of the self adjoint operator . For 

any integers i,j,k,l, n, t,p, q satisfying —M < i,j, k,i < M, \i — j\ < L, 0 < n,t,p < N — 1, 
and 0 < q < T — 1, it holds that 

dQ^,t{ka) 

dXp,q(i, j) 
dXp,q{i, j) 

The following basic property of convolution operators is well known [?]• 

Proposition 6.1. A convolution operator B = [Bk-i]k,ee'z where the blocks are N xN matri¬ 
ces is a bounded operator on /^(Z) if and only if there exists a bounded N x N matrix function 
Bif) on [0,1] such that 

Bk= [ exp(-2z7r/c/)B(/) df k gZ, 

Jo 

i.e., the are the Fourier coefficients of the 1-periodie function equal to B{f) on [0,1]. 

Let us identify the function B{ f) on [0,1] with the multiplication operator B : T^([0,1] —)• 
C^) —)• T^([0,1] —)• C^) who sends g{f) to {Bg){f) = B{f)g{f). Clearly, the spectral norm 
of this operator coincides with supjgjQ ||S(/)||. Let 

Hn : T^([0,1] C^) /^(Z), T)v(gf) = ( [ exp{-2nrkf)g{f) df) = {gk)k&z 

\Jq /k£Z 

be the isometric operator that sends g to the sequence of its Fourier coefficients. Then it 
holds that B = F^BF^q. 


= -Q^,p{k,i)[H^*Q%,t{j4) and 

= -[Q^H^]n,q{k,j)Qf^S.^)- 


We now introduce the two basic tools used in the proof of Theorem 2.3 
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Proposition 6.2. Let ^ G be a eomplex Gaussian centered vector such that = 0 

and = E. Let F : C'^ —>■ M 6e a function polynomially hounded together with its 
derivatives (when seeing F as a function from to Mj. Then 

M 

E[?ir(0] = 

m=l 

This proposition can be proven by an integration by parts. We shall call it for this reason 
the IP formula. 

Proposition 6.3. Letf^ G andV : —)> M 6e as in the statement of Proposition \6^ and 

letVzT = [dT/dzi,... ,dT/S zmV' andVzT = [(9F/5Tl, ■ • •, dV/B zmV'■ Then the following 
inequality holds true: 

VarF(0<E[V,F(ef HV;f(^+E[(VjF(e)r H V^F(e)] . 

This inequality is known as the Poincare-Nash (PN) inequality. For a proof, see |32[ 
Prop. 2.1.6]. 


dm 

. dim _ 


6.2 System of equations controlling EQ(fc, k + m) and EQ(fc, k + m). 

We now enter the core of the proof, which consists in showing that the EQ(A:,t) and the 
E(5(A:,f) satisfy a perturbed infinite system of equations. Our derivations will be greatly 
simplified by the fact that due to the ergodicity of Ht, the operators KQ{z) and KQ{z) are 
bounded convolution operators for any z G C+. 

We start with some variance controls. The following lemma is proven in Appendix]^ The 
proof is based on the use of Proposition 6.3 along with an operator truncation argument. 

Lemma 6.4. For any k,£ G X and any n G {0,..., A" — 1} and t G {0, ..., T — 1}, we have 




and Yav[H*Q]p^q{k,£) < 


2 cr^gi\z\ +y)" 


Our purpose is now to find an equation satisfied by KQp^g{k, i). To that end, we start by 
writing E[[HH*Q]p^q{k,£)] = Xi + X2 + Xs + X4 where xi, X2 and xs are given by the finite 
sums 


Ar-1,T-1 

Xl = X] H^P,tik,i)mijY)QnqUJ)]^ 

i,j n,t=0 

X2 = ^ ^ E[Xp^t{k,i)Ang{j,i)Qnqij,e)], 

i,j n,t=0 

X3 = ^ HAp^tik,i)Xn,tiLi)Qnq{jJ)], 

i,j n,t=0 
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and where 


XA = n[AA*Q]p,q{k,^)]. 

Starting with xij the idea is to develop "&[Xp^t{k.,i)Xn,t{j-,i)Qnq{j-,(-)\ with the help of the IP 
formula. Thanks to this formula, we expect that 


E[Xp,i(fc, i)Xn,tij, i)Qnq{j, ^)] 

= ^E[Xp^t{k,i)Xp^t{u,v)]E 


d{Xn,t{j,i)Qnq{jJ)) 

dXp^t{u,v) 


^ 7t(/c - 

/ ^ rjp ^k—i,u—v^Op^n^u,j^v^i^[Qnq\Jj ^)\ 

u,v 


E 


lTir)4>T{k - if 
T 


^^p,n^k,j^r,oE[Qnq(^jj ■^)] 

E[Xn,tijj i) \.QH\n,t{,j^ ^ )Qp,q{k T , 


(13) 


Note that the vector ^ in the statement of Proposition | 6 . 2| is finite dimensional. Therefore, 
Equation (13) needs to be justified. Given an integer M G N, let and be as in the 
statement of Lemma 6.3 When M > |A:| V |i| V \ j\, we get by applying Proposition 6.2 and 
repeating the derivation above that 

E[Xp,t{k,i)XnAjA)Qnqij,^)] 

kAi+M 


E ^ ^{6p,nSk,jdr,oE[QAAj, ^)] 

- E[Xn,tUA)[Q^H^]nAjA- r)Q^A^ - rA)]]. (14) 


r=kVi—M 


The argument provided in the proof of Lemma 6.4 shows that QnqAi A Qnqij, i) as M ^ 
oo. Since \XpAk,i)AnAjA)QnqijA)\ < \^pA^A)^nAjA)\/y, it holds by the dominated 
convergence theorem that E[Xp^tik,i)Xn,tij,i)QnqUA)] HXpAk,i)XnAjA)QnqijA)] as 
M ^ oo. Using Lemma |6.1| and the summability of 7 , we can show by a similar argument 


that that the right hand side of (14) converges to the right hand side of (13). Equation (13) 
is shown. 

Turning to X 2 , we obtain by a similar derivation 

X2 = -E E — —E[AnAjA)[QH]nAjA-r)Qp,qi^-rA)]- 

i.,j,r n,t=0 
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Considering xi + X 2 , and taking the sum over j and n, we get 

T-l . ^ , ,, ,.2 


- r)QpJk - r,£)]j 


i,r t=0 


T-l 

EE 

r,s t=0 


7Tir)4>T{sf 

T 


E 


Qp^q r, t?) X 


- s,k - s - r) - [H*QH]t^tik - s,k - s - r)) 
Tr Q{k — s, k — s — r) 


-zJ2lT{r)(j)T{sfE Qpq{k-r,l)- 


T 


where the third equality is due to Lemma 6.2 In the second equation, we considered that 
<5r,o = h,t{k — s,k — s — r), the (t, t) element of the {k — s,k — s — r) block of the ‘W x N 
block-matrix representation” of the identity operator. Turning to X3j 'ws have 


X3 = EE Ap^tik, i)E[Xn,t{u, v)Xn,tij, *)]IE 

ij,u,v n,t 

.■,Xt{u - j)(t)T{j - if 


Ap^tjk, 

i,j,u,v n,t 


T 


dQn,q{j,(!) 

dXn,t{u,v). 

Sj.i,u-vE[Qn,nU^u)[H*Q]t,q{^J)] 


■ ^Tir)(j)T{sfAp^t{k, i)E 

2 ,r,s t 


TtQ{s + i,s + i + r) 


[H*Q]t,q{i + r,£) 


We now “decouple” the terms within the expectations in the expressions of xi + X 2 and xs 
by using the inequality |EXy - EXEy| = |E[(X - EX)(y - Ey)]| < (Var X)i/2(Var y)^/^ 
along with Lemma [6^ By the ergodicity of H, terms such as E[Qp,q(A:, £)], E[[H*Q]p^q{k,£)], 
or K[[AH*Q]p^q(k, £)] depend onk — £ only. With a small notation abuse, we shall henceforth 
denote the first of these terms as E[Qp^q{k,£)\ or E[Qp^q{k — £)] interchangeably, and similarly 
for the other terms. With these notations, we get 


Tr EQ(r) 


where 


Xi + X 2 = -za^ XT{r)E[Qpq{k - r - £)] 


\e (k - £)\ < 2V^^Vkl(kl+l/) 




and 


X3 = -(TlY^'^XT{r)Ap^t{k,i) 

i^r t 


TrEg(-r) 


E[[H*Q]t,qii,£-r)] +e'^^^{k-£) 


where e' satisfies by Lemma 6.4 




T 
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since \Ap^t{k^i)\ < VTYIz II^(^)0II ^ VTa. Recalling that ^ is a convolution operator, 
we have 


X3 = 


2 


'^lT{r) 


TrEg(-r) 


¥.[[AH*Q]p^q{k + r-t)\ +e'p^q{k-e). 


By the identity HH*Q{z) = {HH* — zI)Q{z) + zQ = I + zQ{z), we get 

mpq{k -i) = -z-^I^^gik -i) + z-\xi + X2) + z-^E[AH*Q]p,g{k - i), 
E[AH*Q]p,g{k-£) = X3 + X4. 


Specifically, 


EQp^g{k) = -z ^Ip^gik) - a^'^xrir) EQpg{k - r) 


+ z-^E[AH*Q]p,g{k)+£p^g{k), 

E[AH*Q]p,gik) = -4 xH-r) E[AH*Q]p,gik - r) 

r 

+ E[AA* Q]p^g(k) + £p^g{k). 

Similar derivations lead to the identities 


EQp,gik) = -z ^Ip^g{k) - (jlJ2^T{-r) 

r 

-j-z ^E[A*HQ]p^g{k) + £p^g{k), 


TrEQ(r) 


EQp^g{k - r) 


E[A*HQ]p,gik) = -a^Y.^T{r) 


TrEQ(r) 


E[A*HQ]p,g{k-r) 


+ E[A* AQ]p^g{k) + £p^q{k) 


where 




6.3 Passage to the frequency domain; End of the proof 

For any 2 ; G C+, we can identify the matrix functions St{-,z) and defined in the 

statement of Theorem 2.2 with the multiplication operators z)g){f) = Sxif, z)g{f) 

and (ST{-,z)g){f) = ST{f,z)g{f) on £^([0,1] C^) and £^([0,1] C^) respectively. 

Given a function g G £^([0,1] —)• C'^) such that ||gi|| > 0, one can show by derivations similar 
to those of Section [5] that 


{{^S{-,z))g,g) 


9{frS{f){S{f)-* - S{f)-^)S{f)*g{f) 

Jo 


for any 2 ; G C+, where Q'S' = {S — S*)/{2i). The self-adjoint operator QS{-,z) is therefore 
positive for any 2 ; G C+. Similarly, one can show that ^{zS{-, z)) is positive on C+, and so is 
the case for QS{-,z) and 3'(2:S'(-, 2 ;)). 
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Let St{z) = [Srik — i){z)]k/^'L and St{z) = [Srik — tj{z)\k,t^z the convolution operators 
obtained through the isometries 

St{z) = T]\[St{-, z)Tj^ and Sriz) = TtSt{-, z)Tj<. 

Then we have the following result: 

Lemma 6.5. For any z G C+, the matrix blocks of the operators St and St satisfy the 
equations (where the parameter z is omitted) 

ST{k) = -z~^5k,olN - o-T -iT{r)(pT{r)ST{k - r) + z~^PT{k), 

r 

PT{k) = -(Tr^7r(-r)v9T(r)-Pr(fc - r) + [^r^y5r](A;), 

r 

ST{k) = -z~^6kfilT -iT{-r)(pT{r)ST{k - r) + z~^PT{k), 

r 

PT{k) = -aT'^'yT{r)'PT{r)PT{k - r) + [A!fATST]ik) 


where 


ipT{r) 


Tr5r(r) 

f 


and (pt{'<') 


Tr 5T(r) 

f 


The convolution operators Pt = [PT{k—i)]k/^i and Pt = [PT{k—i)]k,eei, satisfy the inequality 
||P( 2 ;)|| V ||-P(2:)|| < a?\z\/{^z)‘^. In addition, ||S'r|| V IIS’!! < and furthermore, “ASt, 

^St, ^{zSt) and ^{zSt) are positive. 

There exist four unique bounded convolution operators St = [STik — I)]k,e&z, St = [*S''r(fc — 
i)]k,e£Z, Pt = [Prik - i)]k,e£Z and Pt = [PT{k - I)]k,e£Z satisfying the equations above in 
association with the positivity constraints on ‘A St, IsSt, 1S{zSt) and Q{zSt). 


Proof. Equation Q can be rewritten as 


-zS{f) - za^ijif) * tpif))Sif) + P{f) = I, 

(1 + aS(-/) * ^{f))P{f) = {AA*){f)S{f). 


Taking the Fourier transforms of S{f) and P{f), we recover the first two equations in the 
statement. The other two equations are obtained similarly. We know that the multiplication 
operator S satisfies ||S^|| < 95^ > 0 and Q{zS) > 0. From the expression of P{f) we 

see that the associated multiplication operator has its norm bounded by a?\z\/{^zY. Since 
Pm are Pt are isometries, the norms and positivity constraints are deduced at once. 

Finally, since a bounded convolution operator is uniquely determined by its associated multi¬ 
plication operator, and since the operators S and S are uniquely determined by the conditions 
of Theorem 2.2 we obtain the uniqueness alluded to in the last part of the statement. □ 


We are now in position to establish the hrst statement of Theorem |2.3[ Since 
Tr5T(0)(z) 


N 




(frif, z)df, 
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what we need to show is that N ^ Tr(EQT(0)(2:) — 5 t( 0)(2;)) —)■ 0 for any 2 : G C_|_. To that 
end, we start by showing that 


^ ||EQ(fe)-5(fe)||fro ^ ||EQ(fe)-5(fe)||fro 

At = sup- -= - and At = sup- -= - 

fcez \/T fcez vT 


converge to zero as T —)• 00 in a certain region of C+, where || • ||fi-o is the Frobenius norm. 
Indeed, we have 


EQ{k) - S{k) = -4 J] 7 T(r)^^^^^(EQ(fe - r) - S{k - r)) 

r 

2 ^ , ,TrEQ(r)-TV5(r)^,, , 

- ^T2^7T{r) - - - S{k - r) 

r 

+ z-\E[AH*Q]{k) - Pik)) + E{k), 

E[AH*Q]{k) - P{k) = -4 ^ 7 T(-r) {E[AH*Q](k - r) - Pik - r)) 

r 

+ [AA*iEQ- S)]ik) + E\k) 

where ||£'(/c) ||fro = kp,g(^)P) < C{z)IVT and ||£''(A:)||fi.o < C\z), with Ciz) and 

C'iz) being bounded on the compact subsets of C+. Notice that 


Tr[EQ- S](r) 


T 


< 


EJ[EQ-%p(r)| 


T 


^ (iVEp|[EQ-%p(r)p)V^ ^ ||[EQ-g](r)||fro 

T VT ^ ’ 

and similarly, | Tr[E(5 — S'](r)/T| < ||E(5 — 5](r)||fro/\/T'- We also have \\S{k)\\‘f^^/T < 
c\\S{k)f < cjy^ and \\Pik)\\l^^/T < c||P(A:)|p < ca^\z\^/y^. 

Writing 

\\E[AH*Q]{k) - Pik)U, 


A't = sup 
k 


Vt 


we easily get from the expression of EQik) — S{k) above 

Ar < Ts At + 

y y \A T 

Using the norm inequality ||C/U||fro < ||U|| l4l|fro where U and V are two matrices, we observe 
that 


||[GiGi*(Eg - s)]ik)Uo = -s]ij-i + fe)||fro 

hi 

hi 
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and we get from the expression of ¥\AH*Q]{k) — P{k) that 


+ !:!£'^At + a^Ar + 

y \/r 

and we have two similar inequalities for At and = sup;. \\K[A*HQ]{k) — P(/c)||fro/'v/T. 
These four inequalities can be written in a compact form as 


At{z) < M{z)At{z) + 


where At{z) = [At’, A^, A-p, the inequality is element wise, M{z) is a 4 x 4 matrix 

independent on T whose norm converges to zero as y —>■ oo, and E{z) is vector whose norm 
is bounded on any compact set of C+. 


This result coupled with the inequalities (15) shows that for y large enough, it holds that 
A-i Tr(EQT(0)(z) - St{0){z)) 0. Recall that N-^ TrEgT(0)(^) and TV5t(0)(z) are 

holomorphic and bounded on the compact subsets of C — M+. By the normal family theorem, 
from any sequence of integers depending on T, we can extract a subsequence v{T) such that 
N{v{T))~^ Tr(EQ^(2^)(0)(z) — 5„('r)(0)(2;)) —)• 0 on the compact subsets of C — IR+. But since 
the limit is zero for large enough y, it is identically zero on C — M+. The convergence Q in 
the statement of Theorem 2.3 is proven. To establish the second part of the statement, we 
need the tightness of ttt- 


Lemma 6.6. The sequence tvt is tight. 


Proof. For any small e > 0, any T G N, any / G [0,1] and any eigenvalue of 

{ATA*rp){f), writing CtUav) = cttItU) * ^tUA h) and CxifAy) = f) T>T{f Ay), 

we have 


-ly 


-ly - lyCrif, w) + 


- 1 


w) 


CrifAy) - 

1 + Ct(/) ^2/) 


_ ^i,T{f) 

w+wCT(f,'^y) 


< e 


when y > Ce = 2{cr'^g + a'^)fe, thanks to the inequalities |Ct(/) w)\ < and ^{^yCTif, w)) > 

0 as y > 0. This implies that the Stieltjes transform Pt{z) = N~^ Tv StH, z) df of tvt 

satisfies \iyPTi~w) + 1| < e when y > C^. Since the family (mr(-z))TeN is bounded on the 
compact subsets of C+, from any sequence of integers, one can extract a subsequence v{T) 
such that Py(T){z) converges to a function p'"{z) uniformly on the compacts of C+. The func¬ 
tion p^{z) is moreover the Stieltjes transform of a positive measure p,^. By a passage to the 
limit, it holds that \iyp'"(—ly) + 1| < e when y > C^. This implies that —iyp'^{—iy) —?• 1 as 
y —)• oo, in other words, is a probability measure. We have shown that the sequence of 
probability measures Pt is weakly compact. Therefore it is tight. □ 

From the tightness of the sequence ttt and the convergence Q , we get that any converging 
subsequence of Stieltjes transforms of the yr has the Stieltjes transform of a probability 
measure as a limit. Hence the tightness of the px and the second part of the statement of 
Theorem 12.31 
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6.4 Sketch of the proof of Theorem |2.4 

We remarked above that Equations ([^-Q are similar to their counterparts in |18| Th. 2.4] 
(the latter are “discrete frequency” analogues of Equations Q-Q). For this reason, the 
proof of Theorem 2.4 follows closely the proof of HSl Th. 4.1]. We just provide here the main 
steps of this proof. 

The first observation is that the mutual information per receive antenna f log(l+A)fj-T(dX) 
is related with rriT by the equation 


J log(l + A)//T(dA) = 


OO / ]^ 

- — mT{—t )) dt 


which is obtained from the relation (i[log(l + A/t)\/dt = —t ^ + (A + t) It also holds that 
f XfixidA) < 4(a^ + <T^). Indeed, getting back to the proof of Theorem 


2.1 


consider an event 

where converges weakly to ht as n —)• oo, and where limsup^ f \uJj^{d\) < 2(a^ + cr^). Let 
C* > 0 be a continuity point of ^t- Then for n large enough, 

4(a2 + cr2)> [{X A C) i^^idX) -^ [{X A C) ^iridA). 

J n^oo J 

Taking a sequence of such continuity points that converges to infinity, we obtain the result 
by the monotone convergence theorem. 

By mimicking the proof of |18[ Lemma C.l], we can also show that f Xn^idA) < a^ + cr^. 
This shows that 


Therefore, 


< \t~^ -pT{-t)\ + -mT{-t))\ 

r-oo 1 ^ 

>0 ^ 


t t X 


< t 


-2 


J XiTTidA) +t~^ J Xprp{dX) < 3(a^ + 


= / log(l + A) /ir(dA) = - Pt(-^)) dt 


is finite, and by the dominated convergence theorem, N~^It{S; (Y, H)) —Xt —)• 0 as T —?■ oo. 
In order to obtain the expression of Xt provided in the statement, we need to find an an¬ 
tiderivative for 1/t — P' 7 ’(—t). This antiderivative can be obtained by a lengthy but straight¬ 
forward adaptation of the derivations of HSl § C.2]. 


A Proof of Lemma 6.4 


Given an integer M £ N, let and Q^{z) be as in the statement of Lemma 6.3, The 


variance of Qpq{k,t) can be bounded by the PN inequality. Specifically, we have by this 
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inequality Yav Q^q{k,£) < Xi + X 2 where 
M N-1,T-1 

XI = ^ X] HXn,t{i,j)Xn,t{u,v)]E 

i,j,u,v=—M n,t=0 
M N-1,T-1 

X2 = ^ ^ E[Xn,t{i,j)Xn,tiu,v)]E 

i,j,u,v=—M n,t=0 


.dXn,tii,j) \dXn,t{u,v)). 

r^ dQ^g{kJ) ~. dQ^g{k,£) , 
\dXn,t{i,j) £ dXn,t{u,v). ■ 


Let us deal with xi- Since 

dQ^,{k,i) 


dX, 


n± 




we get 


Xi = irii - u)^Tii - 


ij,u^v n^t 


E [q^^ik, i) i)Q^n{k, u) £) 

^ ^7T(r)0r(s)^ 


r,s,w n^t 


E 


Q^nik,r + u)[H^*Q^]t,g{r + u- s,i)QM^{k,u)[H^*Q^]t,,{u - s,£) 




Using the inequality \EJ2uXuYu\ < {J2u^\Xu\‘^)^^‘^{Ylu^\Xu\‘^)^^'^, we get 
X1<^Y1 \XT{r)\ (pTisf Y1 Y1 ^\Q^nik, u)[H^*Q^]t,g{u - s, £)\^ 

r,s u n,t 

^Y.\^Tir)\Ms?Y.^[[Q^ik,u)Q^ik,ur]p,j,[[H^*Q^]iu-s,£r[H^*Q^]i 


T 

r,. 

< 

- Ty2 


u — s 




Y,\lT{r)\Y.^\[[H^*Q^] (u, £)* [H^*Q^] {u, £)],,, 


1 

< =^^E 




< 


T y2 
1 <T‘^g{\z\ +y) 
T 


The term X 2 can be treated similarly, using the second identity in the statement of Lemma 6.3 
This leads to the inequality 


VarQ^;(A:,^) < 


2 (T^g{\z\ +y) 


Now, given any vector a G /C, it is clear that H^*a —?■ HH*a strongly as M —)• oo. Since 

/C is a core for HH*, the operator }j^}j^* converges to HH* in the strong resolvent sense |34[ 
Th. VIIL25]. It results that Q^q{k,£) —>■ Qp^q{k,£) almost surely as M ^ oo. Since both 
these random variables are bounded by 1/y, we get that Yai Q^q{k,£) —)-m->-oo Yav Qp^q{k,£), 
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and the bound on Ya,iQp^g{k,£) is established. 

We now establish the bound over YaiTr Q{k,i). 
YaiTTQ^{k,£) < Xi + X2 where 

M N-1,T-1 

Xl = ^ HXn,t{i,j)Xn,t{u,v)]E 

i,j,UyV=—M n,t=0 


Using the PN inequality again, we get 


- dTrQ^jkJ) / aTrg^(fc,£) y 

- dXn,t{i,j) V dXn,t{u,v) ). 


and a similar formula for X2- By the differentiation formula, we have 


dTvQ^jkJ) 

dXn,t{i,j) 






J t^n 


Therefore, 

Xl = ^ X] - u)(t)T{i - jfSi-j^u-v 

i,j,u,v n,t 

= fEE XT{r)(l)T{sf 

r,s,u n^t 

E[[[if"*(3"|(u + r - s, t)Q’-'{k, u + r)],,„||fl-«"Q«](ti - «, f)<3"(fc, «)],,„ 

= fEE 7r(r’)</>r(s)^ 

r,s,u n,t 


E 


[H^*Q^DiJk_eQ'%Au + r 


Ml 


where and Ji are the operators on t^iE) 


s,u + r)[H^*Q^D^Jk-iQ^]t,n{u - s,u) 


[j£a](A:) = a{k + £) and \D^a\{k) = Sk/a{k) 

where a{k) (resp. [j£a]{k), [D£a]{k)) is the k^^ block with size N of the vector a (resp. J^a, 
Dia) in the canonical basis. Pursuing, we have 


XI <^^hTir)\Msf^Yl e| [H^*Q^D£Jk-eQ^]tAu - s, u) 


u n,t 


Observe now that the operator O = Q^* J^_^D£Q ^*is a hnite rank 

operator such as 

y + \z\ 


rank(C)) < N and ||C>|| < 




Therefore, 


EE| [H^*Q^DeJk-eQ%,n{u - s,u) 

u n,t 


^ < TrO < 


which shows that 


Xi < 


c(T‘^g{y + \z\) 
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and we get the same bound for X 2 - It remains to use the same argument as in the first part 
of the proof to obtain the bound on VarTrQ(k,£). 

The derivations leading to the bound on Varlff*Q]p^g(k,£) are omitted, since they are 
similar to above. 
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